Hierarchical Semantic Classification: Word Sense Disambiguation with World Knowledge

نویسندگان

  • Massimiliano Ciaramita
  • Thomas Hofmann
  • Mark Johnson
چکیده

We present a learning architecture for lexical semantic classification problems that supplements task-specific training data with background data encoding general “world knowledge”. The model compiles knowledge contained in a dictionaryontology into additional training data, and integrates task-specific and background data through a novel hierarchical learning architecture. Experiments on a word sense disambiguation task provide empirical evidence that this “hierarchical classifier” outperforms a state-of-the-art standard “flat” one.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wikipedia-based Compact Hierarchical Semantics for Natural Language Processing

A correct semantic representation of words and texts underlies many text processing tasks such as text categorization, word sense disambiguation, and semantic relatedness assessment. It has long been recognized that computers require access to common-sense and domain-specific world knowledge in order to process textual data at a deeper level. In this paper, we present a novel representation of ...

متن کامل

Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification

The introduction of hierarchical thesauri (HT) that contain significant semantic information, has led researchers to investigate their potential for improving performance of the text classification task, extending the traditional “bag of words” representation, incorporating syntactic and semantic relationships among words. In this paper we address this problem by proposing a Word Sense Disambig...

متن کامل

Making Explicit the Hidden Semantics of Hierarchical Classifications

Hierarchical classifications are concept hierarchies used to organize large amounts of documents. File systems, products’ taxonomies for the market place and the directories provided by Web portals are common examples of hierarchical classifications. As semi-structured knowledge sources, hierarchical classifications have peculiar features: they differ both from plain texts since they are based ...

متن کامل

Path-Based Semantic Relatedness on Linked Data and Its Use to Word and Entity Disambiguation

Semantic relatedness and disambiguation are fundamental problems for linking text documents to the Web of Data. There are many approaches dealing with both problems but most of them rely on word or concept distribution over Wikipedia. They are therefore not applicable to concepts that do not have a rich textual description. In this paper, we show that semantic relatedness can also be accurately...

متن کامل

Combining Independent Knowledge Sources for Word Sense Disambiguation

Disambiguation Yorick Wilks and Mark Stevenson Department of Computer Science, University of She eld, Regent Court, 211 Portobello Street, She eld S1 4DP, UK fyorick, [email protected] Abstract Sense tagging, the automatic assignment of the appropriate sense from some lexicon to each of the words in a text, is a specialised instance of the general problem of word sense disambiguation. We di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003